Kodak consumer video benchmark data set : concept definition and annotation

نویسندگان

  • Alexander C. Loui
  • Jiebo Luo
  • Shih-Fu Chang
  • Dan Ellis
  • Wei Jiang
چکیده

Semantic indexing of images and videos in the consumer domain has become a very important issue for both research and actual application. In this work we developed Kodak’s consumer video benchmark data set, which includes (1) a significant number of videos from actual users, (2) a rich lexicon that accommodates consumers’ needs, and (3) the annotation of a subset of concepts over the entire video data set. To the best of our knowledge, this is the first systematic work in the consumer domain aimed at the definition of a large lexicon, construction of a large benchmark data set, and annotation of videos in a rigorous fashion. Such effort will have significant impact by providing a sound foundation for developing and evaluating large-scale learningbased semantic indexing/annotation techniques in the consumer domain. This report includes information about the concept definitions, the annotation process, video collection process, and the data structures used in the release file. The released dataset includes the annotations, extracted visual features (for videos from Kodak), and URLs of videos from YouTube. The Appendix section also includes the full list of concepts (more than 100 concepts in 7 categories) that have been defined in the consumer video domain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building a Large Annotation Ontology for Movie Video Retrieval

Multimedia content continues to grow rapidly. To ensure access to growing video collections, semantic indexing of images and videos has become a very important issue for data access, retrieval and actual application. For developing and evaluating semantic concepts searching annotation techniques, it is necessary to predefine a large lexicon, construction of a large benchmark data set, and annot...

متن کامل

VIREO-374: LSCOM Semantic Concept Detectors Using Local Keypoint Features

Semantic concept detection aims to rank video shots in large scale video corpus according to the presence of a specific concept, such as ``sports'', ``charts'', ``people marching'', and etc. In recently years, mainly motivated by the NIST TRECVID [1] which provides common video data and benchmark evaluation, a number of successful concept detection systems have been developed. As manually annot...

متن کامل

Robust Semantic Video Indexing by Harvesting Web Images

Semantic video indexing, also known as video annotation, video concept detection in literatures, has attracted significant attentions recently. Due to the scarcity of training videos, most existing approaches can scarcely achieve satisfactory performances. This paper proposes a robust semantic video indexing framework, which exploits user-tagged web images to assist learning robust semantic vid...

متن کامل

Online Multi-Label Active Learning for Large-Scale Multimedia Annotation

Existing video search engines have not taken the advantages of video content analysis and semantic understanding. Video search in academia uses semantic annotation to approach content-based indexing. We argue this is a promising direction to enable real content-based video search. However, due to the complexity of both video data and semantic concepts, existing techniques on automatic video ann...

متن کامل

ImageCLEF 2014: Overview and Analysis of the Results

This paper presents an overview of the ImageCLEF 2014 evaluation lab. Since its first edition in 2003, ImageCLEF has become one of the key initiatives promoting the benchmark evaluation of algorithms for the annotation and retrieval of images in various domains, such as public and personal images, to data acquired by mobile robot platforms and medical archives. Over the years, by providing new ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008